A hierarchical statistical model to assess the confidence of peptides and proteins inferred from tandem mass spectrometry

نویسندگان

  • Changyu Shen
  • Zhiping Wang
  • Ganesh Shankar
  • Xiang Zhang
  • Lang Li
چکیده

MOTIVATION Statistical evaluation of the confidence of peptide and protein identifications made by tandem mass spectrometry is a critical component for appropriately interpreting the experimental data and conducting downstream analysis. Although many approaches have been developed to assign confidence measure from different perspectives, a unified statistical framework that integrates the uncertainty of peptides and proteins is still missing. RESULTS We developed a hierarchical statistical model (HSM) that jointly models the uncertainty of the identified peptides and proteins and can be applied to any scoring system. With data sets of a standard mixture and the yeast proteome, we demonstrate that the HSM offers a reliable or at least conservative false discovery rate (FDR) estimate for peptide and protein identifications. The probability measure of HSM also offers a powerful discriminating score for peptide identification. AVAILABILITY The algorithm is available upon request from the authors.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Proteome analysis of Cryptosporidium parvum and C. hominis using two-dimentional electrophoresis, image analysis and tandem mass spectrometry

Until recently, Cryptosporidium was thought to be a single species genus. Molecular studies now showthat there are at least 10 valid species of this parasite. Among them, two morphologically identical species, C.hominis and C. parvum are the most pathogenic identified to date and share 97% of identical genomes.Post-genomic analyses is therefore necessary to explore further the...

متن کامل

Periplasmic expression of Bacillus thermocatenulatus lipase in Escherichia coli in presence of different signal sequences

Efforts to express lipase in the periplasmic space of Escherichia coli have so far been unsuccessful andmost of the expressed recombinant lipases accumulate in the insoluble cell fraction. To evaluate the role ofnative and heterologous signal peptides in translocation of the lipase across the inner membrane of E. coli,the lipase gene (btl2) was cloned downstream of the native ...

متن کامل

Quantification of Melittin in Iranian Honey Bee (Apis mellifera meda) Venom by Liquid Chromatography-electrospray Ionization-ion Trap Tandem Mass Spectrometry (LC-ESI-IT-MS/MS)

The current research aimed to quantify melittin (MEL) in Iranian honey bee (Apis mellifera meda) venom. To this end, a liquid chromatography-electrospray ionization-ion trap tandem mass spectrometry (LC-ESI-IT-MS/MS) approach was employed. Melittin is the main toxic peptide of honey bee venom with various biological and pharmacological activities. It was extracted with...

متن کامل

Database search algorithm for identification of intact cross-links in proteins and peptides using tandem mass spectrometry.

A new database search algorithm has been developed for identification of intact cross-links in proteins and peptides from tandem mass spectrometric data. Using this algorithm, intact cross-links can be identified and characterized in proteins and peptides with high confidence. The algorithm was tested using BS(3) (bis[sulfosuccinimidyl] suberate) cross-linked Cytochrome C. Five cross-links were...

متن کامل

Development and Evaluation of Methods for Predicting Protein Levels and Peak Intensities from Tandem Mass Spectrometry Data

Tandem mass spectrometry (MS/MS) of peptides is a central technology for proteomics, enabling the identification of thousands of proteins and peptides from a complex mixture. With the increasing acquisition rate of tandem mass spectrometers, it has become possible to use data-mining techniques to attempt to solve important biological problems using MS/MS data. These problems include (i) estimat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Bioinformatics

دوره 24 2  شماره 

صفحات  -

تاریخ انتشار 2008